Search CORE

11 research outputs found

High-Precision Extraction of Emerging Concepts from Scientific Literature

Author: Devlin Jacob
Goodfellow Ian
He Xiangnan
Jo Yookyung
Mesbah Sepideh
Mihalcea Rada
Peters Matthew E
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 11/06/2020
Field of study

Identification of new concepts in scientific literature can help power faceted search, scientific trend analysis, knowledge-base construction, and more, but current methods are lacking. Manual identification cannot keep up with the torrent of new publications, while the precision of existing automatic techniques is too low for many applications. We present an unsupervised concept extraction method for scientific literature that achieves much higher precision than previous work. Our approach relies on a simple but novel intuition: each scientific concept is likely to be introduced or popularized by a single paper that is disproportionately cited by subsequent papers mentioning the concept. From a corpus of computer science papers on arXiv, we find that our method achieves a Precision@1000 of 99%, compared to 86% for prior work, and a substantially better precision-yield trade-off across the top 15,000 extractions. To stimulate research in this area, we release our code and data (https://github.com/allenai/ForeCite).Comment: Accepted to SIGIR 202

arXiv.org e-Print Archive

Crossref

Knowledge Graphs Evolution and Preservation -- A Technical Report from ISWS 2019

Author: Ahmad Sakor
Alba Catalina Morales Tirado
Alessandro Umbrico
Allard Oelen
Amine Dadoun
Aneta Koleva
Anna Nguyen
Ariam Rivas Mendez
Axel Polleres
Bilal Koteich
Chang Sun
Chuangtao Ma
Claudia d'Amato
Eleonora Marzi
Fabio Mariani
Federico Igne
Felix Bensmann
Frances Gillis-Webber
Francesca Alloatti
Francesca Giovannetti
Genet Asefa Gesese
Gianmarco Spinaci
Glenda Amaral
Harald Sack
Harm Delva
Heiko Paulheim
Irene Celino
Ismail Harrando
Ivan Heibi
Jaime Salas
Jan Portisch
John Domingue
Kabul Kurniawan
Kader Pustu-Iren
Kholoud Alghamdi
Laurine Huber
Lientje Maas
Ling Cai
Luigi Asprino
Maheshkumar Mistry
Marc Gallofré Ocaña
Margherita Porena
Marieke van Erp
Martin Beno
Martin Mansfield
Marìa Granados Buey
Meilin Shi
Mengya Liu
Michalis Georgiou
Michel Dumontier
Mohamad Yaser Jaradeh
Molka Tounsi Dhouib
Mortaza Alinam
Nacira Abbas
Neha Keshan
Omaima Fallatah
Paola Espinoza Arias
Riley Capshaw
Russa Biswas
Sebastian Rudolph
Sebastián Ferrada
Sepideh Mesbah
Soheil Roshankish
Stefano De Giorgis
Tabea Tietz
Thomas Schleider
Valentina Anita Carriero
Valentina Pasqual
Valentina Presutti
Viet Bach Nguyen
Vincent Emonet
Vitor Horta
Weiqin Xu
Wouter van den Berg
Publication venue
Publication date: 01/01/2020
Field of study

One of the grand challenges discussed during the Dagstuhl Seminar "Knowledge Graphs: New Directions for Knowledge Representation on the Semantic Web" and described in its report is that of a: "Public FAIR Knowledge Graph of Everything: We increasingly see the creation of knowledge graphs that capture information about the entirety of a class of entities. [...] This grand challenge extends this further by asking if we can create a knowledge graph of "everything" ranging from common sense concepts to location based entities. This knowledge graph should be "open to the public" in a FAIR manner democratizing this mass amount of knowledge." Although linked open data (LOD) is one knowledge graph, it is the closest realisation (and probably the only one) to a public FAIR Knowledge Graph (KG) of everything. Surely, LOD provides a unique testbed for experimenting and evaluating research hypotheses on open and FAIR KG. One of the most neglected FAIR issues about KGs is their ongoing evolution and long term preservation. We want to investigate this problem, that is to understand what preserving and supporting the evolution of KGs means and how these problems can be addressed. Clearly, the problem can be approached from different perspectives and may require the development of different approaches, including new theories, ontologies, metrics, strategies, procedures, etc. This document reports a collaborative effort performed by 9 teams of students, each guided by a senior researcher as their mentor, attending the International Semantic Web Research School (ISWS 2019). Each team provides a different perspective to the problem of knowledge graph evolution substantiated by a set of research questions as the main subject of their investigation. In addition, they provide their working definition for KG preservation and evolution

Archivio istituzionale della ricerca - Università di Bari

HybridEval: A Human-AI Collaborative Approach for Evaluating Design Ideas at Scale

Author: Arous Ines
Bozzon A.
Mesbah Sepideh
Yang J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2023
Field of study

Evaluating design ideas is necessary to predict their success and assess their impact early on in the process. Existing methods rely either on metrics computed by systems that are effective but subject to errors and bias, or experts' ratings, which are accurate but expensive and long to collect. Crowdsourcing offers a compelling way to evaluate a large number of design ideas in a short amount of time while being cost-effective. Workers' evaluation is, however, less reliable and might substantially differ from experts' evaluation. In this work, we investigate workers' rating behavior and compare it with experts. First, we instrument a crowdsourcing study where we asked workers to evaluate design ideas from three innovation challenges. We show that workers share similar insights with experts but tend to rate more generously and weigh certain criteria more importantly. Next, we develop a hybrid human-AI approach that combines a machine learning model with crowdsourcing to evaluate ideas. Our approach models workers' reliability and bias while leveraging ideas' textual content to train a machine learning model. It is able to incorporate experts' ratings whenever available, to supervise the model training and infer worker performance. Results show that our framework outperforms baseline methods and requires significantly less training data from experts, thus providing a viable solution for evaluating ideas at scale.</p

TU Delft Repository

Second Workshop on Recommender Systems for Human Resources (RecSys in HR 2022)

Author: Bogers Toine
Graus David
Gutiérrez Francisco
Johnson Chris
Kaya Mesut
Mesbah Sepideh
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 12/09/2022
Field of study

VBN

End-to-End Bias Mitigation in Candidate Recommender Systems with Fairness Gates

Author: Arafan Adam Mehdi
Beauxis-Aussalet Emma
Bogers Toine
Graus David
Graus David
Gutiérrez Francisco
Johnson Chris
Kaya Mesut
Mesbah Sepideh
Santos Fernando P.
Publication venue: CEUR-WS
Publication date: 19/09/2022
Field of study

Recommender Systems (RS) have proven successful in a wide variety of domains, and the human resources (HR) domain is no exception. RS proved valuable for recommending candidates for a position, although the ethical implications have recently been identified as high-risk by the European Commission. In this study, we apply RS to match candidates with job requests. The RS pipeline includes two fairness gates at two different steps: pre-processing (using GAN-based synthetic candidate generation) and post-processing (with greedily searched candidate re-ranking). While prior research studied fairness at pre- and post-processing steps separately, our approach combines them both in the same pipeline applicable to the HR domain. We show that the combination of gender-balanced synthetic training data with pair re-ranking increased fairness with satisfactory levels of ranking utility. Our findings show that using only the gender-balanced synthetic data for bias mitigation is fairer by a negligible margin when compared to using real data. However, when implemented together with the pair re-ranker, candidate recommendation fairness improved considerably, while maintaining a satisfactory utility score. In contrast, using only the pair re-ranker achieved a similar fairness level, but had a consistently lower utility

Reddit dataset for Adverse Drug Reaction

Author: Bozzon A. (Alessandro)
Houben G.J. (Geert-Jan)
Lofi C. (Christoph)
Mesbah S. (Sepideh)
Sips R.J. (Robert-Jan)
Valle Torre M. (Manuel)
Yang J. (Jie)
Publication venue: 4TU.Centre for Research Data
Publication date
Field of study

Reddit dataset for Adverse Drug Reaction (ADR) detection which was created with the help of expert annotators

Neuron-Miner: An Advanced Tool for Morphological Search and Retrieval in Neuroscientific Image Databases

Author: Amin Katouzian
B Desgraupes
GA Ascoli
H Hotelling
J Wang
LDF Costa
M Slaney
MG Kendall
Mohammadreza Negahdar
Nassir Navab
Philipp L. Rautenberg
PL Rautenberg
PL Rautenberg
R Scorcioni
S Polavaram
Sailesh Conjeti
Sepideh Mesbah
Shaoting Zhang
Y Wan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Knowledge Graphs Evolution and Preservation: A Technical Report from ISWS 2019

Author: Abbas Nacira
Alghamdi Kholoud
Alinam Mortaza
Alloatti Francesca
Amaral Glenda
Arias Paola Espinoza
Asprino Luigi
Beno Martin
Bensmann Felix
Berg Wouter van den
Biswas Russa
Buey Marìa Granados
Cai Ling
Capshaw Riley
Carriero Valentina Anita
Celino Irene
d'Amato Claudia
Dadoun Amine
Delva Harm
Dhouib Molka Tounsi
Domingue John
Dumontier Michel
Emonet Vincent
Erp Marieke van
Fallatah Omaima
Ferrada Sebastián
Georgiou Michalis
Gesese Genet Asefa
Gillis-Webber Frances
Giorgis Stefano De
Giovannetti Francesca
Harrando Ismail
Heibi Ivan
Horta Vitor
Huber Laurine
Igne Federico
Jaradeh Mohamad Yaser
Keshan Neha
Koleva Aneta
Koteich Bilal
Kurniawan Kabul
Liu Mengya
Ma Chuangtao
Maas Lientje
Mansfield Martin
Mariani Fabio
Marzi Eleonora
Mendez Ariam Rivas
Mesbah Sepideh
Mistry Maheshkumar
Nguyen Anna
Nguyen Viet Bach
Ocaña Marc Gallofré
Oelen Allard
Pasqual Valentina
Paulheim Heiko
Polleres Axel
Porena Margherita
Portisch Jan
Presutti Valentina
Pustu-Iren Kader
Roshankish Soheil
Rudolph Sebastian
Sack Harald
Sakor Ahmad
Salas Jaime
Schleider Thomas
Shi Meilin
Spinaci Gianmarco
Sun Chang
Tietz Tabea
Tirado Alba Catalina Morales
Umbrico Alessandro
Xu Weiqin
Publication venue: Arxiv.org
Publication date: 01/01/2020
Field of study

Knowledge Graphs Evolution and Preservation -- A Technical Report from ISWS 2019

Author: Abbas Nacira
Alghamdi Kholoud
Alinam Mortaza
Alloatti Francesca
Amaral Glenda
Arias Paola Espinoza
Asprino Luigi
Beno Martin
Bensmann Felix
Berg Wouter van den
Biswas Russa
Buey Marìa Granados
Cai Ling
Capshaw Riley
Carriero Valentina Anita
Celino Irene
d'Amato Claudia
Dadoun Amine
De Giorgis Stefano
Delva Harm
Dhouib Molka,
Domingue John
Dumontier Michel
Emonet Vincent
Fallatah Omaima
Ferrada Sebastián
Georgiou Michalis
Gesese Genet Asefa
Gillis-Webber Frances
Giovannetti Francesca
Harrando Ismail
Heibi Ivan
Horta Vitor
Huber Laurine
Igne Federico
Jaradeh Mohamad Yaser
Keshan Neha
Koleva Aneta
Koteich Bilal
Kurniawan Kabul
Liu Mengya
Ma Chuangtao
Maas Lientje
Mansfield Martin
Mariani Fabio
Marzi Eleonora
Mendez Ariam Rivas
Mesbah Sepideh
Mistry Maheshkumar
Nguyen Anna
Nguyen Viet Bach
Ocaña Marc Gallofré
Oelen Allard
Pasqual Valentina
Paulheim Heiko
Polleres Axel
Porena Margherita
Portisch Jan
Presutti Valentina
Pustu-Iren Kader
Roshankish Soheil
Rudolph Sebastian
Sack Harald
Sakor Ahmad
Salas Jaime
Schleider Thomas
Shi Meilin
Spinaci Gianmarco
Sun Chang
Tietz Tabea
Tirado Alba Catalina Morales
Umbrico Alessandro
Van Erp Marieke
Xu Weiqin
Publication venue: HAL CCSD
Publication date: 15/01/2021
Field of study

HAL Descartes